A new 2-kbit/s speech coder based on normalized pitch waveform

نویسندگان

  • Yusuke Hiwasaki
  • Kazunori Mano
چکیده

Speech coding at very low bitrate is useful for purposes such as voice communication over computer networks. However, speech coding at around 2.0 kbit/s is di cult for CELP coders while maintaining a high quality. In this paper, a speech coding model called `normalized pitch waveform' and its quantization scheme are presented, aiming for effective compression coding of the `voiced' speech. Listening tests has proven that an e cient and high quality coding has been achieved at bitrate 2.0 kbit/s, less than half of the FS1016. Furthermore, this paper discusses the disadvantage of the normalized pitch waveform and presents an alternative method of using non-normalized pitch waveform. Encoding of a transitional `mixed' state between the `voiced' and the `unvoiced' state is discussed for further improvements.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design of a toll-quality 4-kbit/s speech coder based on phase-adaptive PSI-CELP

This paper describes the design of a toll-quality 4-kbit/s speech coder based on phase-adaptive PSI-CELP. This adaptation method not only gives pitch periodicity to the random excitation but also synchronizes the basic point of the stored random vector with the pitch phase. We further improve the proposed coder by introducing a backward gain prediction scheme. In subjective evaluation experimen...

متن کامل

An 8 kbit/s ACELP coder with improved background noise performance

This paper describes an 8 kbit/s ACELP speech coder with high performance for both speech and non-speech signals such as background noise. While the traditional waveform matching LPAS structure employed in many existing speech coders provides high quality for speech signals, it has significant performance limitations for e.g. background noise. The coder presented here employs a novel adaptive g...

متن کامل

A Pitch Pulse Evolution Model for a Dual Excitation Linear Predictive Speech Coder

This paper introduces a new technique to model the excitation waveform for a linear predictive speech coder The target appli cation is high quality speech coding for rates near kb s Our pitch pulse evolution model decomposes the excitation into two separate but simultaneous signals the evolving pitch pulse com ponent and the unvoiced noise like contribution A number of formulations for decompos...

متن کامل

A 16-kbit/s wideband speech codec scalable with g.729

A wideband speech scalable codec is proposed for improving the flexibility in telecommunication networks. This coder is scalable with G.729 (ITU 8-kbit/s standard). Its decoder can process the incoming bitstream at three bit rates (8, 12, and 16 kbit/s) and provide a choice of speech types (wideband and telephone-band). The codec has a split-band structure, where both bands are coded by analysi...

متن کامل

A Pitch Pulse Evolution Model for a Dual ExcitationLinear Predictive Speech

This paper introduces a new technique to model the excitation waveform for a linear predictive speech coder. The target application is high quality speech coding for rates near 4 kb/s. Our pitch pulse evolution model decomposes the excitation into two separate but simultaneous signals: the evolving pitch pulse component and the unvoiced, noise-like contribution. A number of formulations for dec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997